An Approach in Big Data Analytics to Improve the Velocity of Unstructured Data Using MapReduce

نویسندگان

چکیده

Big Data Analytics is an innovative approach for extracting the data from a huge volume of warehouse systems. It reveals method to compress high into clusters by MapReduce and HDFS. However, processing has taken more time extract store in Hadoop clusters. The proposed system deals with challenges delay shuffle phase map-reduce due scheduling sequencing. For improving speed big data, this work using Compressed Elastic Search Index (CESI) MapReduce-Based Next Generation Sequencing Approach (MRBNGSA). This helps increase retrieval HDFS because way it stored that. only metadata which takes less memory during runtime compare reduces CPU utilization allocation resource manager Framework imroves speed, such that be reduced minimum latency.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection

Big data sizes are constantly increasing. Big data analytics is where advanced analytic techniques are applied on big data sets. Analytics based on large data samples reveals and leverages business change. The popularity of big data analytics platforms, which are often available as open-source, has not remained unnoticed by big companies. Google uses MapReduce for PageRank and inverted indexes....

متن کامل

Hadoop Mapreduce Framework in Big Data Analytics

As Hadoop is a Substantial scale, open source programming system committed to adaptable, disseminated, information concentrated processing. Hadoop [1] Mapreduce is a programming structure for effectively composing requisitions which prepare boundless measures of information (multi-terabyte information sets) inparallel on extensive bunches (many hubs) of merchandise fittings in a dependable, sho...

متن کامل

Application of Big Data Analytics in Power Distribution Network

Smart grid enhances optimization in generation, distribution and consumption of the electricity by integrating information and communication technologies into the grid. Today, utilities are moving towards smart grid applications, most common one being deployment of smart meters in advanced metering infrastructure, and the first technical challenge they face is the huge volume of data generated ...

متن کامل

Big Data Analytics: An Approach using Hadoop Distributed File System

Today’s world is driven by Growth and Innovation for a better future. All of which are based on analysis and harnessing of tons of data, typically known as Big Data. The tasks involved for achieving results at such a scale can be challenging and painfully slow. This paper works towards an approach for effectively solving a large and computationally intensive problem by leveraging the capabiliti...

متن کامل

A Review: Mapreduce and Spark for Big Data Analytics

In this paper we discuss the various challenges of Big Data and problem arises due to continuous explosion of data resulting from the likes of social media and other online sources to gain access to deeper analysis of their data. This paper discusses two of the comparison of Hadoop Map Reduce and the recently introduced Apache Spark – both of which provide a processing model for analyzing big d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International journal of system dynamics applications

سال: 2021

ISSN: ['2160-9772', '2160-9799']

DOI: https://doi.org/10.4018/ijsda.20211001.oa6